Dataset statistics
| Number of variables | 26 |
|---|---|
| Number of observations | 29601 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 5.7 MiB |
| Average record size in memory | 201.0 B |
Variable types
| Numeric | 22 |
|---|---|
| Categorical | 3 |
| Boolean | 1 |
PayStat/Sept05 is highly correlated with PayStat/Aug05 and 4 other fields | High correlation |
PayStat/Aug05 is highly correlated with PayStat/Sept05 and 5 other fields | High correlation |
PayStat/Jul05 is highly correlated with PayStat/Sept05 and 5 other fields | High correlation |
PayStat/Jun05 is highly correlated with PayStat/Sept05 and 5 other fields | High correlation |
PayStat/May05 is highly correlated with PayStat/Sept05 and 5 other fields | High correlation |
PayStat/Apr05 is highly correlated with PayStat/Aug05 and 4 other fields | High correlation |
Outstanding/Sept05 is highly correlated with Outstanding/Aug05 and 4 other fields | High correlation |
Outstanding/Aug05 is highly correlated with Outstanding/Sept05 and 4 other fields | High correlation |
Outstanding/Jul05 is highly correlated with Outstanding/Sept05 and 4 other fields | High correlation |
Outstanding/Jun05 is highly correlated with Outstanding/Sept05 and 4 other fields | High correlation |
Outstanding/May05 is highly correlated with Outstanding/Sept05 and 4 other fields | High correlation |
Outstanding/Apr05 is highly correlated with Outstanding/Sept05 and 4 other fields | High correlation |
PayStats is highly correlated with PayStat/Sept05 and 5 other fields | High correlation |
PayStat/Sept05 is highly correlated with PayStat/Aug05 and 3 other fields | High correlation |
PayStat/Aug05 is highly correlated with PayStat/Sept05 and 8 other fields | High correlation |
PayStat/Jul05 is highly correlated with PayStat/Sept05 and 10 other fields | High correlation |
PayStat/Jun05 is highly correlated with PayStat/Sept05 and 11 other fields | High correlation |
PayStat/May05 is highly correlated with PayStat/Aug05 and 9 other fields | High correlation |
PayStat/Apr05 is highly correlated with PayStat/Aug05 and 9 other fields | High correlation |
Outstanding/Sept05 is highly correlated with PayStat/Aug05 and 9 other fields | High correlation |
Outstanding/Aug05 is highly correlated with PayStat/Aug05 and 11 other fields | High correlation |
Outstanding/Jul05 is highly correlated with PayStat/Aug05 and 12 other fields | High correlation |
Outstanding/Jun05 is highly correlated with PayStat/Jul05 and 14 other fields | High correlation |
Outstanding/May05 is highly correlated with PayStat/Jul05 and 14 other fields | High correlation |
Outstanding/Apr05 is highly correlated with PayStat/Jun05 and 12 other fields | High correlation |
Paid/Sept05 is highly correlated with Outstanding/Sept05 and 5 other fields | High correlation |
Paid/Aug05 is highly correlated with Outstanding/Jul05 and 5 other fields | High correlation |
Paid/Jul05 is highly correlated with Outstanding/Jun05 and 7 other fields | High correlation |
Paid/Jun05 is highly correlated with Outstanding/Jun05 and 6 other fields | High correlation |
Paid/May05 is highly correlated with Outstanding/Jun05 and 5 other fields | High correlation |
Paid/Apr05 is highly correlated with Outstanding/May05 and 4 other fields | High correlation |
PayStats is highly correlated with PayStat/Sept05 and 11 other fields | High correlation |
Unnamed: 0 is highly correlated with Default | High correlation |
Credit Limit is highly correlated with PayStat/Aug05 and 5 other fields | High correlation |
Age is highly correlated with Default | High correlation |
PayStat/Sept05 is highly correlated with Default | High correlation |
PayStat/Aug05 is highly correlated with Credit Limit and 1 other fields | High correlation |
PayStat/Jul05 is highly correlated with Credit Limit and 1 other fields | High correlation |
PayStat/Jun05 is highly correlated with Credit Limit and 1 other fields | High correlation |
PayStat/May05 is highly correlated with Credit Limit and 1 other fields | High correlation |
PayStat/Apr05 is highly correlated with Credit Limit and 1 other fields | High correlation |
Outstanding/Sept05 is highly correlated with Outstanding/Aug05 and 5 other fields | High correlation |
Outstanding/Aug05 is highly correlated with Outstanding/Sept05 and 5 other fields | High correlation |
Outstanding/Jul05 is highly correlated with Outstanding/Sept05 and 5 other fields | High correlation |
Outstanding/Jun05 is highly correlated with Outstanding/Sept05 and 5 other fields | High correlation |
Outstanding/May05 is highly correlated with Outstanding/Sept05 and 5 other fields | High correlation |
Outstanding/Apr05 is highly correlated with Outstanding/Sept05 and 5 other fields | High correlation |
Paid/Sept05 is highly correlated with Default | High correlation |
Paid/Aug05 is highly correlated with Default | High correlation |
Paid/Jul05 is highly correlated with Default | High correlation |
Paid/Jun05 is highly correlated with Default | High correlation |
Paid/May05 is highly correlated with Default | High correlation |
Paid/Apr05 is highly correlated with Default | High correlation |
Default is highly correlated with Unnamed: 0 and 20 other fields | High correlation |
Marital Status is highly correlated with Age | High correlation |
Credit Limit is highly correlated with Outstanding/Aug05 and 5 other fields | High correlation |
Outstanding/Aug05 is highly correlated with Credit Limit and 6 other fields | High correlation |
Paid/May05 is highly correlated with Paid/Aug05 and 1 other fields | High correlation |
PayStat/Jun05 is highly correlated with PayStat/May05 and 5 other fields | High correlation |
PayStat/May05 is highly correlated with PayStat/Jun05 and 6 other fields | High correlation |
Outstanding/Jun05 is highly correlated with Credit Limit and 6 other fields | High correlation |
Outstanding/May05 is highly correlated with Credit Limit and 8 other fields | High correlation |
Paid/Sept05 is highly correlated with Paid/Aug05 and 2 other fields | High correlation |
Paid/Aug05 is highly correlated with Paid/May05 and 3 other fields | High correlation |
PayStat/Jul05 is highly correlated with PayStat/Jun05 and 5 other fields | High correlation |
Paid/Jun05 is highly correlated with Paid/Sept05 and 1 other fields | High correlation |
PayStat/Apr05 is highly correlated with PayStat/Jun05 and 6 other fields | High correlation |
Age is highly correlated with Marital Status | High correlation |
PayStats is highly correlated with PayStat/Jun05 and 5 other fields | High correlation |
PayStat/Aug05 is highly correlated with PayStat/Jun05 and 5 other fields | High correlation |
Default is highly correlated with PayStat/Sept05 | High correlation |
PayStat/Sept05 is highly correlated with PayStat/Jun05 and 6 other fields | High correlation |
Outstanding/Sept05 is highly correlated with Credit Limit and 6 other fields | High correlation |
Outstanding/Apr05 is highly correlated with Credit Limit and 6 other fields | High correlation |
Outstanding/Jul05 is highly correlated with Outstanding/Aug05 and 6 other fields | High correlation |
Paid/Jul05 is highly correlated with Credit Limit and 8 other fields | High correlation |
Paid/Aug05 is highly skewed (γ1 = 30.62926199) | Skewed |
Unnamed: 0 is uniformly distributed | Uniform |
Unnamed: 0 has unique values | Unique |
PayStat/Sept05 has 14499 (49.0%) zeros | Zeros |
PayStat/Aug05 has 15476 (52.3%) zeros | Zeros |
PayStat/Jul05 has 15518 (52.4%) zeros | Zeros |
PayStat/Jun05 has 16204 (54.7%) zeros | Zeros |
PayStat/May05 has 16684 (56.4%) zeros | Zeros |
PayStat/Apr05 has 16053 (54.2%) zeros | Zeros |
Outstanding/Sept05 has 1981 (6.7%) zeros | Zeros |
Outstanding/Aug05 has 2466 (8.3%) zeros | Zeros |
Outstanding/Jul05 has 2826 (9.5%) zeros | Zeros |
Outstanding/Jun05 has 3143 (10.6%) zeros | Zeros |
Outstanding/May05 has 3433 (11.6%) zeros | Zeros |
Outstanding/Apr05 has 3929 (13.3%) zeros | Zeros |
Paid/Sept05 has 5192 (17.5%) zeros | Zeros |
Paid/Aug05 has 5334 (18.0%) zeros | Zeros |
Paid/Jul05 has 5891 (19.9%) zeros | Zeros |
Paid/Jun05 has 6318 (21.3%) zeros | Zeros |
Paid/May05 has 6600 (22.3%) zeros | Zeros |
Paid/Apr05 has 7043 (23.8%) zeros | Zeros |
PayStats has 2470 (8.3%) zeros | Zeros |
Reproduction
| Analysis started | 2021-07-30 01:01:00.299140 |
|---|---|
| Analysis finished | 2021-07-30 01:05:32.403759 |
| Duration | 4 minutes and 32.1 seconds |
| Software version | pandas-profiling v3.0.0 |
| Download configuration | config.json |
| Distinct | 29601 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14971.75893 |
| Minimum | 1 |
|---|---|
| Maximum | 30000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 231.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1495 |
| Q1 | 7474 |
| median | 14953 |
| Q3 | 22463 |
| 95-th percentile | 28488 |
| Maximum | 30000 |
| Range | 29999 |
| Interquartile range (IQR) | 14989 |
Descriptive statistics
| Standard deviation | 8660.18443 |
|---|---|
| Coefficient of variation (CV) | 0.5784346697 |
| Kurtosis | -1.199491067 |
| Mean | 14971.75893 |
| Median Absolute Deviation (MAD) | 7494 |
| Skewness | 0.004481755181 |
| Sum | 443179036 |
| Variance | 74998794.35 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 1 | 1 | < 0.1% |
| 19965 | 1 | < 0.1% |
| 19963 | 1 | < 0.1% |
| 19962 | 1 | < 0.1% |
| 19961 | 1 | < 0.1% |
| 19960 | 1 | < 0.1% |
| 19959 | 1 | < 0.1% |
| 19958 | 1 | < 0.1% |
| 19957 | 1 | < 0.1% |
| 19956 | 1 | < 0.1% |
| Other values (29591) | 29591 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 |
| Value | Count | Frequency (%) |
| 30000 | 1 | |
| 29999 | 1 | |
| 29998 | 1 | |
| 29997 | 1 | |
| 29996 | 1 | |
| 29995 | 1 | |
| 29994 | 1 | |
| 29993 | 1 | |
| 29992 | 1 | |
| 29991 | 1 |
| Distinct | 81 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 167550.5449 |
| Minimum | 10000 |
|---|---|
| Maximum | 1000000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 231.4 KiB |
Quantile statistics
| Minimum | 10000 |
|---|---|
| 5-th percentile | 20000 |
| Q1 | 50000 |
| median | 140000 |
| Q3 | 240000 |
| 95-th percentile | 430000 |
| Maximum | 1000000 |
| Range | 990000 |
| Interquartile range (IQR) | 190000 |
Descriptive statistics
| Standard deviation | 129944.021 |
|---|---|
| Coefficient of variation (CV) | 0.7755511689 |
| Kurtosis | 0.5318491585 |
| Mean | 167550.5449 |
| Median Absolute Deviation (MAD) | 90000 |
| Skewness | 0.9928498687 |
| Sum | 4959663680 |
| Variance | 1.688544858 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 50000 | 3331 | 11.3% |
| 20000 | 1955 | 6.6% |
| 30000 | 1586 | 5.4% |
| 80000 | 1542 | 5.2% |
| 200000 | 1498 | 5.1% |
| 150000 | 1080 | 3.6% |
| 100000 | 1035 | 3.5% |
| 180000 | 979 | 3.3% |
| 360000 | 872 | 2.9% |
| 60000 | 819 | 2.8% |
| Other values (71) | 14904 |
| Value | Count | Frequency (%) |
| 10000 | 486 | 1.6% |
| 16000 | 1 | < 0.1% |
| 20000 | 1955 | |
| 30000 | 1586 | |
| 40000 | 226 | 0.8% |
| 50000 | 3331 | |
| 60000 | 819 | 2.8% |
| 70000 | 726 | 2.5% |
| 80000 | 1542 | |
| 90000 | 641 | 2.2% |
| Value | Count | Frequency (%) |
| 1000000 | 1 | < 0.1% |
| 800000 | 2 | < 0.1% |
| 780000 | 2 | < 0.1% |
| 760000 | 1 | < 0.1% |
| 750000 | 4 | |
| 740000 | 2 | < 0.1% |
| 730000 | 2 | < 0.1% |
| 720000 | 3 | < 0.1% |
| 710000 | 6 | |
| 700000 | 8 |
Sex
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 231.4 KiB |
| F | |
|---|---|
| M |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 29601 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | F |
|---|---|
| 2nd row | F |
| 3rd row | F |
| 4th row | F |
| 5th row | M |
Common Values
| Value | Count | Frequency (%) |
| F | 17855 | |
| M | 11746 |
Length
Pie chart
| Value | Count | Frequency (%) |
| f | 17855 | |
| m | 11746 |
Most occurring characters
| Value | Count | Frequency (%) |
| F | 17855 | |
| M | 11746 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 29601 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 17855 | |
| M | 11746 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 29601 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| F | 17855 | |
| M | 11746 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 29601 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| F | 17855 | |
| M | 11746 |
Education
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 231.4 KiB |
| BSc | |
|---|---|
| MSc or PHd | |
| High School Diploma | |
| Other | 123 |
Length
| Max length | 19 |
|---|---|
| Median length | 10 |
| Mean length | 8.144454579 |
| Min length | 3 |
Characters and Unicode
| Total characters | 241084 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | BSc |
|---|---|
| 2nd row | BSc |
| 3rd row | BSc |
| 4th row | BSc |
| 5th row | BSc |
Common Values
| Value | Count | Frequency (%) |
| BSc | 14024 | |
| MSc or PHd | 10581 | |
| High School Diploma | 4873 | 16.5% |
| Other | 123 | 0.4% |
Length
Pie chart
| Value | Count | Frequency (%) |
| bsc | 14024 | |
| msc | 10581 | |
| or | 10581 | |
| phd | 10581 | |
| high | 4873 | 8.1% |
| school | 4873 | 8.1% |
| diploma | 4873 | 8.1% |
| other | 123 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 30908 | ||
| S | 29478 | |
| c | 29478 | |
| o | 25200 | |
| H | 15454 | 6.4% |
| B | 14024 | 5.8% |
| r | 10704 | 4.4% |
| M | 10581 | 4.4% |
| P | 10581 | 4.4% |
| d | 10581 | 4.4% |
| Other values (11) | 54095 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 125062 | |
| Uppercase Letter | 85114 | |
| Space Separator | 30908 | 12.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 29478 | |
| o | 25200 | |
| r | 10704 | 8.6% |
| d | 10581 | 8.5% |
| h | 9869 | 7.9% |
| i | 9746 | 7.8% |
| l | 9746 | 7.8% |
| g | 4873 | 3.9% |
| p | 4873 | 3.9% |
| m | 4873 | 3.9% |
| Other values (3) | 5119 | 4.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 29478 | |
| H | 15454 | |
| B | 14024 | |
| M | 10581 | 12.4% |
| P | 10581 | 12.4% |
| D | 4873 | 5.7% |
| O | 123 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 30908 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 210176 | |
| Common | 30908 | 12.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 29478 | |
| c | 29478 | |
| o | 25200 | |
| H | 15454 | 7.4% |
| B | 14024 | 6.7% |
| r | 10704 | 5.1% |
| M | 10581 | 5.0% |
| P | 10581 | 5.0% |
| d | 10581 | 5.0% |
| h | 9869 | 4.7% |
| Other values (10) | 44226 |
Common
| Value | Count | Frequency (%) |
| 30908 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 241084 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 30908 | ||
| S | 29478 | |
| c | 29478 | |
| o | 25200 | |
| H | 15454 | 6.4% |
| B | 14024 | 5.8% |
| r | 10704 | 4.4% |
| M | 10581 | 4.4% |
| P | 10581 | 4.4% |
| d | 10581 | 4.4% |
| Other values (11) | 54095 |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 231.4 KiB |
| Single | |
|---|---|
| Married | |
| Other | 318 |
Length
| Max length | 7 |
|---|---|
| Median length | 6 |
| Mean length | 6.444545792 |
| Min length | 5 |
Characters and Unicode
| Total characters | 190765 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Married |
|---|---|
| 2nd row | Single |
| 3rd row | Single |
| 4th row | Married |
| 5th row | Married |
Common Values
| Value | Count | Frequency (%) |
| Single | 15806 | |
| Married | 13477 | |
| Other | 318 | 1.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| single | 15806 | |
| married | 13477 | |
| other | 318 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 29601 | |
| i | 29283 | |
| r | 27272 | |
| S | 15806 | |
| n | 15806 | |
| g | 15806 | |
| l | 15806 | |
| M | 13477 | |
| a | 13477 | |
| d | 13477 | |
| Other values (3) | 954 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 161164 | |
| Uppercase Letter | 29601 | 15.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 29601 | |
| i | 29283 | |
| r | 27272 | |
| n | 15806 | |
| g | 15806 | |
| l | 15806 | |
| a | 13477 | |
| d | 13477 | |
| t | 318 | 0.2% |
| h | 318 | 0.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 15806 | |
| M | 13477 | |
| O | 318 | 1.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 190765 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 29601 | |
| i | 29283 | |
| r | 27272 | |
| S | 15806 | |
| n | 15806 | |
| g | 15806 | |
| l | 15806 | |
| M | 13477 | |
| a | 13477 | |
| d | 13477 | |
| Other values (3) | 954 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 190765 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 29601 | |
| i | 29283 | |
| r | 27272 | |
| S | 15806 | |
| n | 15806 | |
| g | 15806 | |
| l | 15806 | |
| M | 13477 | |
| a | 13477 | |
| d | 13477 | |
| Other values (3) | 954 | 0.5% |
| Distinct | 56 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 35.46407216 |
| Minimum | 21 |
|---|---|
| Maximum | 79 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 231.4 KiB |
Quantile statistics
| Minimum | 21 |
|---|---|
| 5-th percentile | 23 |
| Q1 | 28 |
| median | 34 |
| Q3 | 41 |
| 95-th percentile | 53 |
| Maximum | 79 |
| Range | 58 |
| Interquartile range (IQR) | 13 |
Descriptive statistics
| Standard deviation | 9.213243334 |
|---|---|
| Coefficient of variation (CV) | 0.2597909031 |
| Kurtosis | 0.05551028991 |
| Mean | 35.46407216 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 0.7373094047 |
| Sum | 1049772 |
| Variance | 84.88385273 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 29 | 1593 | 5.4% |
| 27 | 1455 | 4.9% |
| 28 | 1397 | 4.7% |
| 30 | 1382 | 4.7% |
| 26 | 1245 | 4.2% |
| 31 | 1205 | 4.1% |
| 25 | 1176 | 4.0% |
| 34 | 1147 | 3.9% |
| 32 | 1143 | 3.9% |
| 33 | 1127 | 3.8% |
| Other values (46) | 16731 |
| Value | Count | Frequency (%) |
| 21 | 64 | 0.2% |
| 22 | 553 | 1.9% |
| 23 | 917 | |
| 24 | 1117 | |
| 25 | 1176 | |
| 26 | 1245 | |
| 27 | 1455 | |
| 28 | 1397 | |
| 29 | 1593 | |
| 30 | 1382 |
| Value | Count | Frequency (%) |
| 79 | 1 | < 0.1% |
| 75 | 3 | < 0.1% |
| 74 | 1 | < 0.1% |
| 73 | 4 | < 0.1% |
| 72 | 3 | < 0.1% |
| 71 | 3 | < 0.1% |
| 70 | 10 | |
| 69 | 15 | |
| 68 | 5 | < 0.1% |
| 67 | 16 |
PayStat/Sept05
Real number (ℝ)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.01493192798 |
| Minimum | -2 |
|---|---|
| Maximum | 8 |
| Zeros | 14499 |
| Zeros (%) | 49.0% |
| Negative | 8341 |
| Negative (%) | 28.2% |
| Memory size | 231.4 KiB |
Quantile statistics
| Minimum | -2 |
|---|---|
| 5-th percentile | -2 |
| Q1 | -1 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 8 |
| Range | 10 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.124502894 |
|---|---|
| Coefficient of variation (CV) | -75.30862032 |
| Kurtosis | 2.720747276 |
| Mean | -0.01493192798 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.7331730188 |
| Sum | -442 |
| Variance | 1.26450676 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 14499 | |
| -1 | 5633 | 19.0% |
| 1 | 3662 | 12.4% |
| -2 | 2708 | 9.1% |
| 2 | 2640 | 8.9% |
| 3 | 320 | 1.1% |
| 4 | 76 | 0.3% |
| 5 | 24 | 0.1% |
| 8 | 19 | 0.1% |
| 6 | 11 | < 0.1% |
| Value | Count | Frequency (%) |
| -2 | 2708 | 9.1% |
| -1 | 5633 | 19.0% |
| 0 | 14499 | |
| 1 | 3662 | 12.4% |
| 2 | 2640 | 8.9% |
| 3 | 320 | 1.1% |
| 4 | 76 | 0.3% |
| 5 | 24 | 0.1% |
| 6 | 11 | < 0.1% |
| 7 | 9 | < 0.1% |
| Value | Count | Frequency (%) |
| 8 | 19 | 0.1% |
| 7 | 9 | < 0.1% |
| 6 | 11 | < 0.1% |
| 5 | 24 | 0.1% |
| 4 | 76 | 0.3% |
| 3 | 320 | 1.1% |
| 2 | 2640 | 8.9% |
| 1 | 3662 | 12.4% |
| 0 | 14499 | |
| -1 | 5633 | 19.0% |
PayStat/Aug05
Real number (ℝ)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.1313131313 |
| Minimum | -2 |
|---|---|
| Maximum | 8 |
| Zeros | 15476 |
| Zeros (%) | 52.3% |
| Negative | 9712 |
| Negative (%) | 32.8% |
| Memory size | 231.4 KiB |
Quantile statistics
| Minimum | -2 |
|---|---|
| 5-th percentile | -2 |
| Q1 | -1 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 8 |
| Range | 10 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.199642203 |
|---|---|
| Coefficient of variation (CV) | -9.135736773 |
| Kurtosis | 1.557982384 |
| Mean | -0.1313131313 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.791420645 |
| Sum | -3887 |
| Variance | 1.439141414 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 15476 | |
| -1 | 5990 | 20.2% |
| 2 | 3904 | 13.2% |
| -2 | 3722 | 12.6% |
| 3 | 326 | 1.1% |
| 4 | 97 | 0.3% |
| 1 | 28 | 0.1% |
| 5 | 25 | 0.1% |
| 7 | 20 | 0.1% |
| 6 | 12 | < 0.1% |
| Value | Count | Frequency (%) |
| -2 | 3722 | 12.6% |
| -1 | 5990 | 20.2% |
| 0 | 15476 | |
| 1 | 28 | 0.1% |
| 2 | 3904 | 13.2% |
| 3 | 326 | 1.1% |
| 4 | 97 | 0.3% |
| 5 | 25 | 0.1% |
| 6 | 12 | < 0.1% |
| 7 | 20 | 0.1% |
| Value | Count | Frequency (%) |
| 8 | 1 | < 0.1% |
| 7 | 20 | 0.1% |
| 6 | 12 | < 0.1% |
| 5 | 25 | 0.1% |
| 4 | 97 | 0.3% |
| 3 | 326 | 1.1% |
| 2 | 3904 | 13.2% |
| 1 | 28 | 0.1% |
| 0 | 15476 | |
| -1 | 5990 | 20.2% |
PayStat/Jul05
Real number (ℝ)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.1634404243 |
| Minimum | -2 |
|---|---|
| Maximum | 8 |
| Zeros | 15518 |
| Zeros (%) | 52.4% |
| Negative | 9890 |
| Negative (%) | 33.4% |
| Memory size | 231.4 KiB |
Quantile statistics
| Minimum | -2 |
|---|---|
| 5-th percentile | -2 |
| Q1 | -1 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 8 |
| Range | 10 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.199792708 |
|---|---|
| Coefficient of variation (CV) | -7.34085654 |
| Kurtosis | 2.072829638 |
| Mean | -0.1634404243 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.841479725 |
| Sum | -4838 |
| Variance | 1.439502541 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 15518 | |
| -1 | 5863 | 19.8% |
| -2 | 4027 | 13.6% |
| 2 | 3802 | 12.8% |
| 3 | 237 | 0.8% |
| 4 | 76 | 0.3% |
| 7 | 27 | 0.1% |
| 6 | 23 | 0.1% |
| 5 | 21 | 0.1% |
| 1 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| -2 | 4027 | 13.6% |
| -1 | 5863 | 19.8% |
| 0 | 15518 | |
| 1 | 4 | < 0.1% |
| 2 | 3802 | 12.8% |
| 3 | 237 | 0.8% |
| 4 | 76 | 0.3% |
| 5 | 21 | 0.1% |
| 6 | 23 | 0.1% |
| 7 | 27 | 0.1% |
| Value | Count | Frequency (%) |
| 8 | 3 | < 0.1% |
| 7 | 27 | 0.1% |
| 6 | 23 | 0.1% |
| 5 | 21 | 0.1% |
| 4 | 76 | 0.3% |
| 3 | 237 | 0.8% |
| 2 | 3802 | 12.8% |
| 1 | 4 | < 0.1% |
| 0 | 15518 | |
| -1 | 5863 | 19.8% |
PayStat/Jun05
Real number (ℝ)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.2183034357 |
| Minimum | -2 |
|---|---|
| Maximum | 8 |
| Zeros | 16204 |
| Zeros (%) | 54.7% |
| Negative | 9904 |
| Negative (%) | 33.5% |
| Memory size | 231.4 KiB |
Quantile statistics
| Minimum | -2 |
|---|---|
| 5-th percentile | -2 |
| Q1 | -1 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 8 |
| Range | 10 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.172219586 |
|---|---|
| Coefficient of variation (CV) | -5.369679968 |
| Kurtosis | 3.488694804 |
| Mean | -0.2183034357 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.003030728 |
| Sum | -6462 |
| Variance | 1.374098757 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 16204 | |
| -1 | 5617 | 19.0% |
| -2 | 4287 | 14.5% |
| 2 | 3142 | 10.6% |
| 3 | 180 | 0.6% |
| 4 | 69 | 0.2% |
| 7 | 58 | 0.2% |
| 5 | 35 | 0.1% |
| 6 | 5 | < 0.1% |
| 1 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| -2 | 4287 | 14.5% |
| -1 | 5617 | 19.0% |
| 0 | 16204 | |
| 1 | 2 | < 0.1% |
| 2 | 3142 | 10.6% |
| 3 | 180 | 0.6% |
| 4 | 69 | 0.2% |
| 5 | 35 | 0.1% |
| 6 | 5 | < 0.1% |
| 7 | 58 | 0.2% |
| Value | Count | Frequency (%) |
| 8 | 2 | < 0.1% |
| 7 | 58 | 0.2% |
| 6 | 5 | < 0.1% |
| 5 | 35 | 0.1% |
| 4 | 69 | 0.2% |
| 3 | 180 | 0.6% |
| 2 | 3142 | 10.6% |
| 1 | 2 | < 0.1% |
| 0 | 16204 | |
| -1 | 5617 | 19.0% |
PayStat/May05
Real number (ℝ)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.2639775683 |
| Minimum | -2 |
|---|---|
| Maximum | 8 |
| Zeros | 16684 |
| Zeros (%) | 56.4% |
| Negative | 9959 |
| Negative (%) | 33.6% |
| Memory size | 231.4 KiB |
Quantile statistics
| Minimum | -2 |
|---|---|
| 5-th percentile | -2 |
| Q1 | -1 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 8 |
| Range | 10 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.136217041 |
|---|---|
| Coefficient of variation (CV) | -4.304218152 |
| Kurtosis | 3.981011416 |
| Mean | -0.2639775683 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.013018627 |
| Sum | -7814 |
| Variance | 1.290989165 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 16684 | |
| -1 | 5480 | 18.5% |
| -2 | 4479 | 15.1% |
| 2 | 2617 | 8.8% |
| 3 | 177 | 0.6% |
| 4 | 84 | 0.3% |
| 7 | 58 | 0.2% |
| 5 | 17 | 0.1% |
| 6 | 4 | < 0.1% |
| 8 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| -2 | 4479 | 15.1% |
| -1 | 5480 | 18.5% |
| 0 | 16684 | |
| 2 | 2617 | 8.8% |
| 3 | 177 | 0.6% |
| 4 | 84 | 0.3% |
| 5 | 17 | 0.1% |
| 6 | 4 | < 0.1% |
| 7 | 58 | 0.2% |
| 8 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 8 | 1 | < 0.1% |
| 7 | 58 | 0.2% |
| 6 | 4 | < 0.1% |
| 5 | 17 | 0.1% |
| 4 | 84 | 0.3% |
| 3 | 177 | 0.6% |
| 2 | 2617 | 8.8% |
| 0 | 16684 | |
| -1 | 5480 | 18.5% |
| -2 | 4479 | 15.1% |
PayStat/Apr05
Real number (ℝ)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.2875578528 |
| Minimum | -2 |
|---|---|
| Maximum | 8 |
| Zeros | 16053 |
| Zeros (%) | 54.2% |
| Negative | 10480 |
| Negative (%) | 35.4% |
| Memory size | 231.4 KiB |
Quantile statistics
| Minimum | -2 |
|---|---|
| 5-th percentile | -2 |
| Q1 | -1 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 8 |
| Range | 10 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.152205693 |
|---|---|
| Coefficient of variation (CV) | -4.006865684 |
| Kurtosis | 3.428958041 |
| Mean | -0.2875578528 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.9528070424 |
| Sum | -8512 |
| Variance | 1.327577958 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 16053 | |
| -1 | 5674 | 19.2% |
| -2 | 4806 | 16.2% |
| 2 | 2756 | 9.3% |
| 3 | 183 | 0.6% |
| 4 | 49 | 0.2% |
| 7 | 46 | 0.2% |
| 6 | 19 | 0.1% |
| 5 | 13 | < 0.1% |
| 8 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| -2 | 4806 | 16.2% |
| -1 | 5674 | 19.2% |
| 0 | 16053 | |
| 2 | 2756 | 9.3% |
| 3 | 183 | 0.6% |
| 4 | 49 | 0.2% |
| 5 | 13 | < 0.1% |
| 6 | 19 | 0.1% |
| 7 | 46 | 0.2% |
| 8 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 8 | 2 | < 0.1% |
| 7 | 46 | 0.2% |
| 6 | 19 | 0.1% |
| 5 | 13 | < 0.1% |
| 4 | 49 | 0.2% |
| 3 | 183 | 0.6% |
| 2 | 2756 | 9.3% |
| 0 | 16053 | |
| -1 | 5674 | 19.2% |
| -2 | 4806 | 16.2% |
Outstanding/Sept05
Real number (ℝ)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 22441 |
|---|---|
| Distinct (%) | 75.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 50957.43201 |
| Minimum | -165580 |
|---|---|
| Maximum | 964511 |
| Zeros | 1981 |
| Zeros (%) | 6.7% |
| Negative | 588 |
| Negative (%) | 2.0% |
| Memory size | 231.4 KiB |
Quantile statistics
| Minimum | -165580 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 3528 |
| median | 22259 |
| Q3 | 66623 |
| 95-th percentile | 200545 |
| Maximum | 964511 |
| Range | 1130091 |
| Interquartile range (IQR) | 63095 |
Descriptive statistics
| Standard deviation | 73370.2424 |
|---|---|
| Coefficient of variation (CV) | 1.439833985 |
| Kurtosis | 9.883084664 |
| Mean | 50957.43201 |
| Median Absolute Deviation (MAD) | 21680 |
| Skewness | 2.673879937 |
| Sum | 1508390945 |
| Variance | 5383192470 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1981 | 6.7% |
| 390 | 243 | 0.8% |
| 780 | 74 | 0.2% |
| 326 | 70 | 0.2% |
| 316 | 63 | 0.2% |
| 2500 | 59 | 0.2% |
| 396 | 49 | 0.2% |
| 2400 | 39 | 0.1% |
| 416 | 29 | 0.1% |
| 1050 | 25 | 0.1% |
| Other values (22431) | 26969 |
| Value | Count | Frequency (%) |
| -165580 | 1 | |
| -154973 | 1 | |
| -15308 | 1 | |
| -14386 | 1 | |
| -11545 | 1 | |
| -10682 | 1 | |
| -9802 | 1 | |
| -9095 | 1 | |
| -8187 | 1 | |
| -7438 | 1 |
| Value | Count | Frequency (%) |
| 964511 | 1 | |
| 746814 | 1 | |
| 653062 | 1 | |
| 630458 | 1 | |
| 621749 | 1 | |
| 613860 | 1 | |
| 610723 | 1 | |
| 608594 | 1 | |
| 604019 | 1 | |
| 589654 | 1 |
Outstanding/Aug05
Real number (ℝ)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 22069 |
|---|---|
| Distinct (%) | 74.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 48942.18955 |
| Minimum | -69777 |
|---|---|
| Maximum | 983931 |
| Zeros | 2466 |
| Zeros (%) | 8.3% |
| Negative | 665 |
| Negative (%) | 2.2% |
| Memory size | 231.4 KiB |
Quantile statistics
| Minimum | -69777 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2970 |
| median | 21050 |
| Q3 | 63497 |
| 95-th percentile | 194111 |
| Maximum | 983931 |
| Range | 1053708 |
| Interquartile range (IQR) | 60527 |
Descriptive statistics
| Standard deviation | 70923.98515 |
|---|---|
| Coefficient of variation (CV) | 1.449137969 |
| Kurtosis | 10.40862349 |
| Mean | 48942.18955 |
| Median Absolute Deviation (MAD) | 20660 |
| Skewness | 2.71669497 |
| Sum | 1448737753 |
| Variance | 5030211670 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2466 | 8.3% |
| 390 | 230 | 0.8% |
| 780 | 75 | 0.3% |
| 326 | 74 | 0.2% |
| 316 | 72 | 0.2% |
| 396 | 51 | 0.2% |
| 2500 | 51 | 0.2% |
| 2400 | 42 | 0.1% |
| -200 | 29 | 0.1% |
| 416 | 28 | 0.1% |
| Other values (22059) | 26483 |
| Value | Count | Frequency (%) |
| -69777 | 1 | |
| -67526 | 1 | |
| -33350 | 1 | |
| -30000 | 1 | |
| -26214 | 1 | |
| -24704 | 1 | |
| -24702 | 1 | |
| -22960 | 1 | |
| -18618 | 1 | |
| -18088 | 1 |
| Value | Count | Frequency (%) |
| 983931 | 1 | |
| 743970 | 1 | |
| 671563 | 1 | |
| 646770 | 1 | |
| 624475 | 1 | |
| 605943 | 1 | |
| 597793 | 1 | |
| 581775 | 1 | |
| 577681 | 1 | |
| 572834 | 1 |
Outstanding/Jul05
Real number (ℝ)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 21763 |
|---|---|
| Distinct (%) | 73.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 46803.20327 |
| Minimum | -157264 |
|---|---|
| Maximum | 1664089 |
| Zeros | 2826 |
| Zeros (%) | 9.5% |
| Negative | 649 |
| Negative (%) | 2.2% |
| Memory size | 231.4 KiB |
Quantile statistics
| Minimum | -157264 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2652 |
| median | 20035 |
| Q3 | 59830 |
| 95-th percentile | 186878 |
| Maximum | 1664089 |
| Range | 1821353 |
| Interquartile range (IQR) | 57178 |
Descriptive statistics
| Standard deviation | 69123.89211 |
|---|---|
| Coefficient of variation (CV) | 1.476905153 |
| Kurtosis | 20.14060006 |
| Mean | 46803.20327 |
| Median Absolute Deviation (MAD) | 19647 |
| Skewness | 3.106686615 |
| Sum | 1385421620 |
| Variance | 4778112460 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2826 | 9.5% |
| 390 | 273 | 0.9% |
| 780 | 72 | 0.2% |
| 316 | 62 | 0.2% |
| 326 | 61 | 0.2% |
| 396 | 47 | 0.2% |
| 2500 | 40 | 0.1% |
| 2400 | 39 | 0.1% |
| 416 | 29 | 0.1% |
| 200 | 26 | 0.1% |
| Other values (21753) | 26126 |
| Value | Count | Frequency (%) |
| -157264 | 1 | |
| -61506 | 1 | |
| -46127 | 1 | |
| -34041 | 1 | |
| -25443 | 1 | |
| -24702 | 1 | |
| -20320 | 1 | |
| -17706 | 1 | |
| -15910 | 1 | |
| -15641 | 1 |
| Value | Count | Frequency (%) |
| 1664089 | 1 | |
| 855086 | 1 | |
| 693131 | 1 | |
| 689643 | 1 | |
| 689627 | 1 | |
| 632041 | 1 | |
| 597415 | 1 | |
| 578971 | 1 | |
| 577957 | 1 | |
| 577015 | 1 |
Outstanding/Jun05
Real number (ℝ)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 21303 |
|---|---|
| Distinct (%) | 72.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 43122.5542 |
| Minimum | -170000 |
|---|---|
| Maximum | 891586 |
| Zeros | 3143 |
| Zeros (%) | 10.6% |
| Negative | 667 |
| Negative (%) | 2.3% |
| Memory size | 231.4 KiB |
Quantile statistics
| Minimum | -170000 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2329 |
| median | 19005 |
| Q3 | 54271 |
| 95-th percentile | 174074 |
| Maximum | 891586 |
| Range | 1061586 |
| Interquartile range (IQR) | 51942 |
Descriptive statistics
| Standard deviation | 64196.38391 |
|---|---|
| Coefficient of variation (CV) | 1.488696231 |
| Kurtosis | 11.36854578 |
| Mean | 43122.5542 |
| Median Absolute Deviation (MAD) | 18606 |
| Skewness | 2.82838955 |
| Sum | 1276470727 |
| Variance | 4121175708 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3143 | 10.6% |
| 390 | 245 | 0.8% |
| 780 | 99 | 0.3% |
| 316 | 68 | 0.2% |
| 326 | 61 | 0.2% |
| 396 | 43 | 0.1% |
| 2400 | 39 | 0.1% |
| 150 | 39 | 0.1% |
| 2500 | 34 | 0.1% |
| 416 | 33 | 0.1% |
| Other values (21293) | 25797 |
| Value | Count | Frequency (%) |
| -170000 | 1 | |
| -81334 | 1 | |
| -65167 | 1 | |
| -50616 | 1 | |
| -46627 | 1 | |
| -34503 | 1 | |
| -27490 | 1 | |
| -24303 | 1 | |
| -22108 | 1 | |
| -20320 | 1 |
| Value | Count | Frequency (%) |
| 891586 | 1 | |
| 706864 | 1 | |
| 628699 | 1 | |
| 616836 | 1 | |
| 572805 | 1 | |
| 569034 | 1 | |
| 565669 | 1 | |
| 563543 | 1 | |
| 548020 | 1 | |
| 542653 | 1 |
Outstanding/May05
Real number (ℝ)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 20783 |
|---|---|
| Distinct (%) | 70.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40235.54518 |
| Minimum | -81334 |
|---|---|
| Maximum | 927171 |
| Zeros | 3433 |
| Zeros (%) | 11.6% |
| Negative | 649 |
| Negative (%) | 2.2% |
| Memory size | 231.4 KiB |
Quantile statistics
| Minimum | -81334 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1780 |
| median | 18091 |
| Q3 | 50072 |
| 95-th percentile | 165725 |
| Maximum | 927171 |
| Range | 1008505 |
| Interquartile range (IQR) | 48292 |
Descriptive statistics
| Standard deviation | 60699.34488 |
|---|---|
| Coefficient of variation (CV) | 1.508600035 |
| Kurtosis | 12.35959093 |
| Mean | 40235.54518 |
| Median Absolute Deviation (MAD) | 17673 |
| Skewness | 2.880165904 |
| Sum | 1191012373 |
| Variance | 3684410469 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3433 | 11.6% |
| 390 | 234 | 0.8% |
| 780 | 91 | 0.3% |
| 316 | 79 | 0.3% |
| 326 | 61 | 0.2% |
| 150 | 58 | 0.2% |
| 396 | 45 | 0.2% |
| 2400 | 39 | 0.1% |
| 2500 | 37 | 0.1% |
| 416 | 36 | 0.1% |
| Other values (20773) | 25488 |
| Value | Count | Frequency (%) |
| -81334 | 1 | |
| -61372 | 1 | |
| -53007 | 1 | |
| -46627 | 1 | |
| -37594 | 1 | |
| -36156 | 1 | |
| -30481 | 1 | |
| -28335 | 1 | |
| -23003 | 1 | |
| -20753 | 1 |
| Value | Count | Frequency (%) |
| 927171 | 1 | |
| 823540 | 1 | |
| 587067 | 1 | |
| 551702 | 1 | |
| 547880 | 1 | |
| 530672 | 1 | |
| 524315 | 1 | |
| 516139 | 1 | |
| 514114 | 1 | |
| 508213 | 1 |
Outstanding/Apr05
Real number (ℝ)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 20396 |
|---|---|
| Distinct (%) | 68.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38858.44982 |
| Minimum | -339603 |
|---|---|
| Maximum | 961664 |
| Zeros | 3929 |
| Zeros (%) | 13.3% |
| Negative | 679 |
| Negative (%) | 2.3% |
| Memory size | 231.4 KiB |
Quantile statistics
| Minimum | -339603 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1278 |
| median | 17118 |
| Q3 | 49121 |
| 95-th percentile | 161912 |
| Maximum | 961664 |
| Range | 1301267 |
| Interquartile range (IQR) | 47843 |
Descriptive statistics
| Standard deviation | 59519.89304 |
|---|---|
| Coefficient of variation (CV) | 1.531710434 |
| Kurtosis | 12.3451577 |
| Mean | 38858.44982 |
| Median Absolute Deviation (MAD) | 16793 |
| Skewness | 2.852904777 |
| Sum | 1150248973 |
| Variance | 3542617668 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3929 | 13.3% |
| 390 | 204 | 0.7% |
| 780 | 86 | 0.3% |
| 150 | 77 | 0.3% |
| 316 | 77 | 0.3% |
| 326 | 54 | 0.2% |
| 396 | 45 | 0.2% |
| 416 | 36 | 0.1% |
| -18 | 33 | 0.1% |
| 2400 | 32 | 0.1% |
| Other values (20386) | 25028 |
| Value | Count | Frequency (%) |
| -339603 | 1 | |
| -209051 | 1 | |
| -150953 | 1 | |
| -94625 | 1 | |
| -73895 | 1 | |
| -57060 | 1 | |
| -51443 | 1 | |
| -51183 | 1 | |
| -46627 | 1 | |
| -45734 | 1 |
| Value | Count | Frequency (%) |
| 961664 | 1 | |
| 699944 | 1 | |
| 568638 | 1 | |
| 527711 | 1 | |
| 527566 | 1 | |
| 514975 | 1 | |
| 513798 | 1 | |
| 511905 | 1 | |
| 501370 | 1 | |
| 499100 | 1 |
| Distinct | 7862 |
|---|---|
| Distinct (%) | 26.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5649.560319 |
| Minimum | 0 |
|---|---|
| Maximum | 873552 |
| Zeros | 5192 |
| Zeros (%) | 17.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 231.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1000 |
| median | 2100 |
| Q3 | 5005 |
| 95-th percentile | 18393 |
| Maximum | 873552 |
| Range | 873552 |
| Interquartile range (IQR) | 4005 |
Descriptive statistics
| Standard deviation | 16568.26494 |
|---|---|
| Coefficient of variation (CV) | 2.932664492 |
| Kurtosis | 419.7892648 |
| Mean | 5649.560319 |
| Median Absolute Deviation (MAD) | 1931 |
| Skewness | 14.77258405 |
| Sum | 167232635 |
| Variance | 274507403.2 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5192 | 17.5% |
| 2000 | 1340 | 4.5% |
| 3000 | 880 | 3.0% |
| 5000 | 689 | 2.3% |
| 1500 | 501 | 1.7% |
| 4000 | 416 | 1.4% |
| 10000 | 396 | 1.3% |
| 1000 | 360 | 1.2% |
| 2500 | 297 | 1.0% |
| 6000 | 291 | 1.0% |
| Other values (7852) | 19239 |
| Value | Count | Frequency (%) |
| 0 | 5192 | |
| 1 | 9 | < 0.1% |
| 2 | 14 | < 0.1% |
| 3 | 15 | 0.1% |
| 4 | 17 | 0.1% |
| 5 | 12 | < 0.1% |
| 6 | 15 | 0.1% |
| 7 | 9 | < 0.1% |
| 8 | 8 | < 0.1% |
| 9 | 7 | < 0.1% |
| Value | Count | Frequency (%) |
| 873552 | 1 | |
| 505000 | 1 | |
| 493358 | 1 | |
| 423903 | 1 | |
| 405016 | 1 | |
| 368199 | 1 | |
| 323014 | 1 | |
| 304815 | 1 | |
| 302000 | 1 | |
| 300039 | 1 |
| Distinct | 7814 |
|---|---|
| Distinct (%) | 26.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5894.788386 |
| Minimum | 0 |
|---|---|
| Maximum | 1684259 |
| Zeros | 5334 |
| Zeros (%) | 18.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 231.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 825 |
| median | 2007 |
| Q3 | 5000 |
| 95-th percentile | 18934 |
| Maximum | 1684259 |
| Range | 1684259 |
| Interquartile range (IQR) | 4175 |
Descriptive statistics
| Standard deviation | 23089.19362 |
|---|---|
| Coefficient of variation (CV) | 3.916882526 |
| Kurtosis | 1649.750282 |
| Mean | 5894.788386 |
| Median Absolute Deviation (MAD) | 1993 |
| Skewness | 30.62926199 |
| Sum | 174491631 |
| Variance | 533110862.1 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5334 | 18.0% |
| 2000 | 1274 | 4.3% |
| 3000 | 848 | 2.9% |
| 5000 | 707 | 2.4% |
| 1000 | 585 | 2.0% |
| 1500 | 516 | 1.7% |
| 4000 | 402 | 1.4% |
| 10000 | 313 | 1.1% |
| 6000 | 281 | 0.9% |
| 2500 | 249 | 0.8% |
| Other values (7804) | 19092 |
| Value | Count | Frequency (%) |
| 0 | 5334 | |
| 1 | 15 | 0.1% |
| 2 | 20 | 0.1% |
| 3 | 18 | 0.1% |
| 4 | 11 | < 0.1% |
| 5 | 25 | 0.1% |
| 6 | 8 | < 0.1% |
| 7 | 12 | < 0.1% |
| 8 | 9 | < 0.1% |
| 9 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 1684259 | 1 | |
| 1227082 | 1 | |
| 1215471 | 1 | |
| 1024516 | 1 | |
| 580464 | 1 | |
| 415552 | 1 | |
| 401003 | 1 | |
| 388126 | 1 | |
| 385228 | 1 | |
| 384986 | 1 |
| Distinct | 7431 |
|---|---|
| Distinct (%) | 25.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5198.415898 |
| Minimum | 0 |
|---|---|
| Maximum | 896040 |
| Zeros | 5891 |
| Zeros (%) | 19.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 231.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 390 |
| median | 1800 |
| Q3 | 4500 |
| 95-th percentile | 17337 |
| Maximum | 896040 |
| Range | 896040 |
| Interquartile range (IQR) | 4110 |
Descriptive statistics
| Standard deviation | 17580.91481 |
|---|---|
| Coefficient of variation (CV) | 3.381975423 |
| Kurtosis | 574.5439067 |
| Mean | 5198.415898 |
| Median Absolute Deviation (MAD) | 1793 |
| Skewness | 17.41906579 |
| Sum | 153878309 |
| Variance | 309088565.4 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5891 | 19.9% |
| 2000 | 1267 | 4.3% |
| 1000 | 1092 | 3.7% |
| 3000 | 862 | 2.9% |
| 5000 | 710 | 2.4% |
| 1500 | 484 | 1.6% |
| 4000 | 375 | 1.3% |
| 10000 | 312 | 1.1% |
| 1200 | 241 | 0.8% |
| 6000 | 238 | 0.8% |
| Other values (7421) | 18129 |
| Value | Count | Frequency (%) |
| 0 | 5891 | |
| 1 | 13 | < 0.1% |
| 2 | 19 | 0.1% |
| 3 | 14 | < 0.1% |
| 4 | 15 | 0.1% |
| 5 | 17 | 0.1% |
| 6 | 14 | < 0.1% |
| 7 | 18 | 0.1% |
| 8 | 10 | < 0.1% |
| 9 | 11 | < 0.1% |
| Value | Count | Frequency (%) |
| 896040 | 1 | |
| 889043 | 1 | |
| 508229 | 1 | |
| 417588 | 1 | |
| 400972 | 1 | |
| 397092 | 1 | |
| 380478 | 1 | |
| 371718 | 1 | |
| 349395 | 1 | |
| 344261 | 1 |
| Distinct | 6880 |
|---|---|
| Distinct (%) | 23.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4828.659268 |
| Minimum | 0 |
|---|---|
| Maximum | 621000 |
| Zeros | 6318 |
| Zeros (%) | 21.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 231.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 298 |
| median | 1500 |
| Q3 | 4014 |
| 95-th percentile | 16014 |
| Maximum | 621000 |
| Range | 621000 |
| Interquartile range (IQR) | 3716 |
Descriptive statistics
| Standard deviation | 15711.05799 |
|---|---|
| Coefficient of variation (CV) | 3.253710216 |
| Kurtosis | 277.6714652 |
| Mean | 4828.659268 |
| Median Absolute Deviation (MAD) | 1500 |
| Skewness | 12.93192018 |
| Sum | 142933143 |
| Variance | 246837343.2 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 6318 | 21.3% |
| 1000 | 1381 | 4.7% |
| 2000 | 1197 | 4.0% |
| 3000 | 876 | 3.0% |
| 5000 | 805 | 2.7% |
| 1500 | 437 | 1.5% |
| 4000 | 393 | 1.3% |
| 10000 | 336 | 1.1% |
| 2500 | 255 | 0.9% |
| 500 | 253 | 0.9% |
| Other values (6870) | 17350 |
| Value | Count | Frequency (%) |
| 0 | 6318 | |
| 1 | 21 | 0.1% |
| 2 | 22 | 0.1% |
| 3 | 13 | < 0.1% |
| 4 | 20 | 0.1% |
| 5 | 12 | < 0.1% |
| 6 | 16 | 0.1% |
| 7 | 11 | < 0.1% |
| 8 | 7 | < 0.1% |
| 9 | 9 | < 0.1% |
| Value | Count | Frequency (%) |
| 621000 | 1 | |
| 528897 | 1 | |
| 497000 | 1 | |
| 432130 | 1 | |
| 400046 | 1 | |
| 331788 | 1 | |
| 330982 | 1 | |
| 320008 | 1 | |
| 313094 | 1 | |
| 292962 | 1 |
| Distinct | 6837 |
|---|---|
| Distinct (%) | 23.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4795.032735 |
| Minimum | 0 |
|---|---|
| Maximum | 426529 |
| Zeros | 6600 |
| Zeros (%) | 22.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 231.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 259 |
| median | 1500 |
| Q3 | 4042 |
| 95-th percentile | 16002 |
| Maximum | 426529 |
| Range | 426529 |
| Interquartile range (IQR) | 3783 |
Descriptive statistics
| Standard deviation | 15244.21715 |
|---|---|
| Coefficient of variation (CV) | 3.179168526 |
| Kurtosis | 182.477426 |
| Mean | 4795.032735 |
| Median Absolute Deviation (MAD) | 1500 |
| Skewness | 11.19205537 |
| Sum | 141937764 |
| Variance | 232386156.6 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 6600 | 22.3% |
| 1000 | 1323 | 4.5% |
| 2000 | 1302 | 4.4% |
| 3000 | 940 | 3.2% |
| 5000 | 804 | 2.7% |
| 1500 | 419 | 1.4% |
| 4000 | 398 | 1.3% |
| 10000 | 340 | 1.1% |
| 500 | 245 | 0.8% |
| 6000 | 243 | 0.8% |
| Other values (6827) | 16987 |
| Value | Count | Frequency (%) |
| 0 | 6600 | |
| 1 | 21 | 0.1% |
| 2 | 12 | < 0.1% |
| 3 | 13 | < 0.1% |
| 4 | 12 | < 0.1% |
| 5 | 8 | < 0.1% |
| 6 | 7 | < 0.1% |
| 7 | 9 | < 0.1% |
| 8 | 6 | < 0.1% |
| 9 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 426529 | 1 | |
| 417990 | 1 | |
| 388071 | 1 | |
| 379267 | 1 | |
| 332000 | 1 | |
| 331788 | 1 | |
| 330982 | 1 | |
| 326889 | 1 | |
| 317077 | 1 | |
| 310135 | 1 |
| Distinct | 6884 |
|---|---|
| Distinct (%) | 23.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5181.326374 |
| Minimum | 0 |
|---|---|
| Maximum | 528666 |
| Zeros | 7043 |
| Zeros (%) | 23.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 231.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 138 |
| median | 1500 |
| Q3 | 4000 |
| 95-th percentile | 17324 |
| Maximum | 528666 |
| Range | 528666 |
| Interquartile range (IQR) | 3862 |
Descriptive statistics
| Standard deviation | 17657.26074 |
|---|---|
| Coefficient of variation (CV) | 3.407864987 |
| Kurtosis | 172.8169309 |
| Mean | 5181.326374 |
| Median Absolute Deviation (MAD) | 1500 |
| Skewness | 10.81967246 |
| Sum | 153372442 |
| Variance | 311778856.8 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 7043 | |
| 1000 | 1290 | 4.4% |
| 2000 | 1279 | 4.3% |
| 3000 | 907 | 3.1% |
| 5000 | 797 | 2.7% |
| 1500 | 434 | 1.5% |
| 4000 | 404 | 1.4% |
| 10000 | 355 | 1.2% |
| 500 | 246 | 0.8% |
| 6000 | 214 | 0.7% |
| Other values (6874) | 16632 |
| Value | Count | Frequency (%) |
| 0 | 7043 | |
| 1 | 20 | 0.1% |
| 2 | 9 | < 0.1% |
| 3 | 14 | < 0.1% |
| 4 | 12 | < 0.1% |
| 5 | 6 | < 0.1% |
| 6 | 6 | < 0.1% |
| 7 | 5 | < 0.1% |
| 8 | 6 | < 0.1% |
| 9 | 7 | < 0.1% |
| Value | Count | Frequency (%) |
| 528666 | 1 | |
| 527143 | 1 | |
| 443001 | 1 | |
| 422000 | 1 | |
| 403500 | 1 | |
| 377000 | 1 | |
| 372495 | 1 | |
| 351282 | 1 | |
| 345293 | 1 | |
| 308000 | 1 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.0 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 22996 | |
| True | 6605 | 22.3% |
| Distinct | 47 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.92047566 |
| Minimum | -6 |
|---|---|
| Maximum | 42 |
| Zeros | 2470 |
| Zeros (%) | 8.3% |
| Negative | 4405 |
| Negative (%) | 14.9% |
| Memory size | 231.4 KiB |
Quantile statistics
| Minimum | -6 |
|---|---|
| 5-th percentile | -6 |
| Q1 | 1 |
| median | 6 |
| Q3 | 6 |
| 95-th percentile | 16 |
| Maximum | 42 |
| Range | 48 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 5.905153497 |
|---|---|
| Coefficient of variation (CV) | 1.200118425 |
| Kurtosis | 2.683194013 |
| Mean | 4.92047566 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.6948094664 |
| Sum | 145651 |
| Variance | 34.87083783 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6 | 10023 | |
| 0 | 2470 | 8.3% |
| -6 | 2072 | 7.0% |
| 8 | 1864 | 6.3% |
| 5 | 1321 | 4.5% |
| 2 | 1148 | 3.9% |
| 4 | 1056 | 3.6% |
| 3 | 1046 | 3.5% |
| 1 | 1045 | 3.5% |
| -3 | 859 | 2.9% |
| Other values (37) | 6697 |
| Value | Count | Frequency (%) |
| -6 | 2072 | |
| -5 | 87 | 0.3% |
| -4 | 291 | 1.0% |
| -3 | 859 | 2.9% |
| -2 | 481 | 1.6% |
| -1 | 615 | 2.1% |
| 0 | 2470 | |
| 1 | 1045 | |
| 2 | 1148 | |
| 3 | 1046 |
| Value | Count | Frequency (%) |
| 42 | 1 | < 0.1% |
| 39 | 19 | |
| 38 | 20 | |
| 37 | 9 | < 0.1% |
| 36 | 1 | < 0.1% |
| 35 | 1 | < 0.1% |
| 34 | 27 | |
| 33 | 12 | |
| 32 | 1 | < 0.1% |
| 31 | 3 | < 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| Unnamed: 0 | Credit Limit | Sex | Education | Marital Status | Age | PayStat/Sept05 | PayStat/Aug05 | PayStat/Jul05 | PayStat/Jun05 | PayStat/May05 | PayStat/Apr05 | Outstanding/Sept05 | Outstanding/Aug05 | Outstanding/Jul05 | Outstanding/Jun05 | Outstanding/May05 | Outstanding/Apr05 | Paid/Sept05 | Paid/Aug05 | Paid/Jul05 | Paid/Jun05 | Paid/May05 | Paid/Apr05 | Default | PayStats | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 20000 | F | BSc | Married | 24 | 2 | 2 | -1 | -1 | -2 | -2 | 3913 | 3102 | 689 | 0 | 0 | 0 | 0 | 689 | 0 | 0 | 0 | 0 | True | 4 |
| 1 | 2 | 120000 | F | BSc | Single | 26 | -1 | 2 | 0 | 0 | 0 | 2 | 2682 | 1725 | 2682 | 3272 | 3455 | 3261 | 0 | 1000 | 1000 | 1000 | 0 | 2000 | True | 9 |
| 2 | 3 | 90000 | F | BSc | Single | 34 | 0 | 0 | 0 | 0 | 0 | 0 | 29239 | 14027 | 13559 | 14331 | 14948 | 15549 | 1518 | 1500 | 1000 | 1000 | 1000 | 5000 | False | 6 |
| 3 | 4 | 50000 | F | BSc | Married | 37 | 0 | 0 | 0 | 0 | 0 | 0 | 46990 | 48233 | 49291 | 28314 | 28959 | 29547 | 2000 | 2019 | 1200 | 1100 | 1069 | 1000 | False | 6 |
| 4 | 5 | 50000 | M | BSc | Married | 57 | -1 | 0 | -1 | 0 | 0 | 0 | 8617 | 5670 | 35835 | 20940 | 19146 | 19131 | 2000 | 36681 | 10000 | 9000 | 689 | 679 | False | 4 |
| 5 | 6 | 50000 | M | MSc or PHd | Single | 37 | 0 | 0 | 0 | 0 | 0 | 0 | 64400 | 57069 | 57608 | 19394 | 19619 | 20024 | 2500 | 1815 | 657 | 1000 | 1000 | 800 | False | 6 |
| 6 | 7 | 500000 | M | MSc or PHd | Single | 29 | 0 | 0 | 0 | 0 | 0 | 0 | 367965 | 412023 | 445007 | 542653 | 483003 | 473944 | 55000 | 40000 | 38000 | 20239 | 13750 | 13770 | False | 6 |
| 7 | 8 | 100000 | F | BSc | Single | 23 | 0 | -1 | -1 | 0 | 0 | -1 | 11876 | 380 | 601 | 221 | -159 | 567 | 380 | 601 | 0 | 581 | 1687 | 1542 | False | 3 |
| 8 | 9 | 140000 | F | High School Diploma | Married | 28 | 0 | 0 | 2 | 0 | 0 | 0 | 11285 | 14096 | 12108 | 12211 | 11793 | 3719 | 3329 | 0 | 432 | 1000 | 1000 | 1000 | False | 8 |
| 9 | 10 | 20000 | M | High School Diploma | Single | 35 | -2 | -2 | -2 | -2 | -1 | -1 | 0 | 0 | 0 | 0 | 13007 | 13912 | 0 | 0 | 0 | 13007 | 1122 | 0 | False | -4 |
Last rows
| Unnamed: 0 | Credit Limit | Sex | Education | Marital Status | Age | PayStat/Sept05 | PayStat/Aug05 | PayStat/Jul05 | PayStat/Jun05 | PayStat/May05 | PayStat/Apr05 | Outstanding/Sept05 | Outstanding/Aug05 | Outstanding/Jul05 | Outstanding/Jun05 | Outstanding/May05 | Outstanding/Apr05 | Paid/Sept05 | Paid/Aug05 | Paid/Jul05 | Paid/Jun05 | Paid/May05 | Paid/Apr05 | Default | PayStats | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 29591 | 29991 | 140000 | M | BSc | Married | 41 | 0 | 0 | 0 | 0 | 0 | 0 | 138325 | 137142 | 139110 | 138262 | 49675 | 46121 | 6000 | 7000 | 4228 | 1505 | 2000 | 2000 | False | 6 |
| 29592 | 29992 | 210000 | M | BSc | Married | 34 | 3 | 2 | 2 | 2 | 2 | 2 | 2500 | 2500 | 2500 | 2500 | 2500 | 2500 | 0 | 0 | 0 | 0 | 0 | 0 | True | 19 |
| 29593 | 29993 | 10000 | M | High School Diploma | Married | 43 | 0 | 0 | 0 | -2 | -2 | -2 | 8802 | 10400 | 0 | 0 | 0 | 0 | 2000 | 0 | 0 | 0 | 0 | 0 | False | 0 |
| 29594 | 29994 | 100000 | M | MSc or PHd | Single | 38 | 0 | -1 | -1 | 0 | 0 | 0 | 3042 | 1427 | 102996 | 70626 | 69473 | 55004 | 2000 | 111784 | 4000 | 3000 | 2000 | 2000 | False | 4 |
| 29595 | 29995 | 80000 | M | BSc | Single | 34 | 2 | 2 | 2 | 2 | 2 | 2 | 72557 | 77708 | 79384 | 77519 | 82607 | 81158 | 7000 | 3500 | 0 | 7000 | 0 | 4000 | True | 18 |
| 29596 | 29996 | 220000 | M | High School Diploma | Married | 39 | 0 | 0 | 0 | 0 | 0 | 0 | 188948 | 192815 | 208365 | 88004 | 31237 | 15980 | 8500 | 20000 | 5003 | 3047 | 5000 | 1000 | False | 6 |
| 29597 | 29997 | 150000 | M | High School Diploma | Single | 43 | -1 | -1 | -1 | -1 | 0 | 0 | 1683 | 1828 | 3502 | 8979 | 5190 | 0 | 1837 | 3526 | 8998 | 129 | 0 | 0 | False | 2 |
| 29598 | 29998 | 30000 | M | BSc | Single | 37 | 4 | 3 | 2 | -1 | 0 | 0 | 3565 | 3356 | 2758 | 20878 | 20582 | 19357 | 0 | 0 | 22000 | 4200 | 2000 | 3100 | True | 14 |
| 29599 | 29999 | 80000 | M | High School Diploma | Married | 41 | 1 | -1 | 0 | 0 | 0 | -1 | -1645 | 78379 | 76304 | 52774 | 11855 | 48944 | 85900 | 3409 | 1178 | 1926 | 52964 | 1804 | True | 5 |
| 29600 | 30000 | 50000 | M | BSc | Married | 46 | 0 | 0 | 0 | 0 | 0 | 0 | 47929 | 48905 | 49764 | 36535 | 32428 | 15313 | 2078 | 1800 | 1430 | 1000 | 1000 | 1000 | True | 6 |